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We report the draft genome sequence of a Staphylococcus aureus NCTC8325 derivative, strain HG003. HG003 contains func- 
tional global regulators rsb 17 and tcaR and is therefore considered as a reference for studies of regulation and virulence. The ge- 
nome is composed of 2,797,898 bp and will be essential for subsequent RNAseq analysis. 
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Staphylococcus aureus is a serious opportunistic human patho- 
gen responsible for food poisoning, skin infections, and multi- 
resistant nosocomial infections (1). Strain HG003 is considered as 
a reference model by the scientific community, with functional 
rshU axid tcaR global regulators (2). However, while its parental 
strain NCTC8325 was isolated in 1960 from a sepsis patient (3) 
and its genome sequenced in 2001 (4), strain HG003 was gener- 
ated with molecular tools only in 2010 (2). Staphylococci have 
high genomic plasticity (i.e., by recombination and by elevated 
mutation frequency) (5-8). Therefore, there is a need to obtain an 
accurate genome sequence of this reference strain, especially for 
those working on S. aureus physiology and virulence, and also 
because there is growing interest in high-throughput RNA se- 
quencing. Here, we present the draft genome of reference strain 
HG003. 

Genomic DNA was isolated from strain HG003 grown on BHI 
agar plate at 37°C using the Wizard genomic DNA purification kit 
(Promega) following the manufacturer recommendations for an 
efficient lysis of S. aureus. Subsequently, genomic DNA was pre- 
cipitate with sodium acetate and washed two times with ethanol 
70% (vol/vol). DNA was sheared using a Covaris M220 to obtain 
a peak at 600 pb. Unwanted small and large fragments were re- 
moved by size-selection using AMPure XP beads (Beckman 
Coulter). DNA library was prepared using the NEBNext ultra 
DNA library prep kit for lUumina (NEB) and sequenced as paired- 
end reads using an lUumina MiSeq platform and a MiSeq reagent 
kit v3 (600 cycles) (lUumina, Inc., San Diego, CA). The run gen- 
erated 1,826,904 reads (548 Mb of sequence data) with 95 X cov- 
erage. 

lUumina reads, trimmed using Trimmomatic (9) and quality 
filtered using Fastx-toolkit (http://hannonlab.cshl.edu/fastx 
_toolkit/), were assembled using the Spades software (10, 11). 

These analyses yielded 19 scaffolds of 20 contigs containing 
2,797,898 bp, for a 32.74% G+C content, representing 99.14% of 
the completed genome using its parental strain S. aureus subsp. 
aureus NCTC8325 as a reference. We first verified that rshU and 
tcaR genes were actually restored in the clone sequenced. How- 
ever, the average nucleotide identity between the two strains is of 



99.97% suggesting that other mutations are present in strain 
HG003 compared to NCTC8325. 

The genome sequence was annotated by the NCBI Prokaryotic 
Genomes Annotation Pipeline (PGAP) (12). The genome com- 
prises 2,647 genes, 11 predicted RNAs (5S, 16S, and 23S), 50 
tRNAs, one transfer-messenger RNA (tmRNA). Among the 2,647 
genes, 2,573 (97.2%) were found to encode putative proteins and 
488 (18.96%) were assigned as hypothetical proteins. The genome 
encodes three intact Staphylococcus complete phages and four 
questionable clustered regularly interspaced short palindromic 
repeats (CRISPRs), according to the PHAge Search Tool (Phast) 
(13) and the CRISPERfinder online software (14). 

Nucleotide sequence accession numbers. This whole-genome 
shotgun project has been deposited at DDBJ/EMBL/GenBank un- 
der the accession no. JPPUOOOOOOOO. The version described in this 
paper is version JPPUOIOOOOOO. 
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